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ABSTRACT 

The neutral hydrogen properties of 1,822 galaxies are being studied with the Green 
Bank lOOm and the Parkes 64ni telescopes as part of the 'Cosmic Flows' program. 
Observed parameters include systemic velocities, profile line widths, and integrated 
fluxes. The line width information can be combined with optical and infrared pho- 
tometry to obtain distances. The 1,822 HI observations complement an inventory of 
archives. All told, HI line width information is available for almost all of five samples: 
(i) luminosity-line width correlation calibrators, (ii) zero-point calibrators for the su- 
pernova la scale, (iii) a dense local sample of spiral galaxies with Mks < —21 within 
3,000 km s~^, (iv) a sparser sample of 60 /zm selected galaxies within 6,000 km s~^ 
that provides all-sky coverage of our extended supercluster complex, and (v) an even 
sparser sample of fiat galaxies, extreme edge-on spirals, extending in a volume out to 
12,000 km s~^. The HI information for 13,941 galaxies, whether from the archives or 
acquired as part of the Cosmic Flows observational program, is uniformly re-measured 
and made available through the Extragalactic Distance Database web site. 
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1 INTRODUCTION 

Cosmography is the study of the large scale structure of the 
universe. A complete analysis involves observational and in- 
terpretive components. With spectroscopic information, a 
complete sample of galaxies within specified limits describes 
structure in redshift space. If distance measures are available 
for at least some fraction of the spectroscopic sample then 
steps can be taken to transform to physical space. Distance 
measures allow separation of redshifts into cosmic expansion 
and deviant (or peculiar) components. Cosmological simula- 
tions and orbit reconstructions provide tools for the recovery 
of information about the underlying distribution of matter 
from a map of peculiar velocities. The influence of the dark 
sector on galaxy motions can be studied from ~ 1 Mpc, the 
scale of collapse, to ^ 150 Mpc, the largest scale of useful 
peculiar velocity measures. 

After great enthusiasm for what cou ld be learned from 
peculiar velocity studies in the 1990's (|Willick fc StraussI 



I1998D . (jCourteau et al.l l2000D progress slowed primarily be- 
cause of the challenge presented by the need for much 
more and much better data. If the goal is to have a 
dense grid of distance measures to a depth dominated 
by Hubble expansion then most methodologies for esti- 
mating dist ances have inadequaci es. The Cepheid Period- 
Luminosity (iFreedman et al 1200 ih and Tip of the Red Gi- 
ant Branch (IRizzi et al methods have limited reaches. 
The Surface Brightness Fluctu ation jTonrv et al.l 120011 ) 
and Fundamental Plane (CoUes s et al.ll200ll ') methods ap- 
ply to luminous early type galaxies that are poorly rep- 
resented in low density regi ons. The Type la Supernova 
(|Jha. Riess. fc Khshneij[2007l ) method rests on serendipity, 
resulting in an accurate but sparse map of distances. The one 
well established methodology that can provide decent dis- 
tances with high density over an appropriately large volume 
is provided by the correlation between gal ajcy luminosity 
and r otation rate, the TuUy-Fisher Relation jTullv fc Fished 
ll97'iD . 



* E-mail: h.courtoisSipnl. in2p3.fr 

t The National Radio Astronomy Observatory is a facility of the 
National Science Foundation, operated under cooperative agree- 
ment by Associated Universities, Inc. 



Two observations are needed to apply this method: a 
spectroscopic measure of the rotation rate, most expedi- 
tiously accomplished by observing the line width of a 21 cm 
Neutral Hydrogen profile, and surface photometry at opti- 
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cal or infrared bands that monitor old star populations. New 
capabilities with both spectroscopy and photometry are rev- 
olutionizing our capabilities. On the spectroscopic side, the 
new capabilities are both realized and promised. We will 
describe observations with the 100m Robert C Byrd Green 
Bank Telescope at the National Radio Astronomy Obser- 
vatory (NRAO-GBT) and with the 13 channel Multibeam 
Receiver on the 64m Parkes Telescope. Our program also 
makes use of archival data from ongoing multibeam obser- 
vations with the Arecibo Telescope (Giovanelli et al. 2005). 
The promised capabilities are those that will accrue with 
the wide-field interferometric surveys that will cover the 
entire southern (ASKAP, MeerKAT) and northern hemi- 
spheres (Apertif, LOFAR) with unprecedented sensitivities. 
As for the photometry, the ground technology revolution 
comes from the large multi-band surveys from new wide- 
field CCD camera systems Pan-STARRS in the north and 
SKYMAPPER in the south. Also the satellites Spitzer and 
WISE are providing unprecedentedly accurate surface pho- 
tometry of galaxies in the mid-infrared. 

The data must be compared with analytical or numer- 
ical models. The theoretical tools have developed tremen- 
dously in the last 20 years from small N-body programs to 
extremely large hydrodynamical simulations that have cul- 
minated with su ch data products as the Millennium Sim- 
ulation Project (jSpringel et al.l [20051 ). The advances have 
focused on increasing computational speed, the number of 
particles, and parallelism in the codes. The field is now em- 
bracing higher levels of refinement in the constraints on ini- 
tial conditions, coming from the observations. The theoreti- 
cal universe being built in the new generation of simulations 
must implement observational evidence with a high level of 
detail. In particular on scales of 1-150 Mpc, there is a tension 
between the observed luminous matter distribution and the 
complexity of galaxy fiows, both in directions and in ampli- 
tudes that must be reconciled within th eoretical constructs . 
New developments in analysis methods jZaroubi et al.|[T995l ) 
use information in the gradients and convergence of flows to 
recover mass distribution information that extends beyond 
the observational data. There is an encouraging synergy in 
the advances on both observational and theoretical fronts. 

This conjunction of observational and theoretical 
progress creates a special opportunity for the emergence of 
the 'Cosmic Flows' program that was proposed to NRAO 
as a large program of observations of the HI 21cm line of 
galaxies with the GBT telescope. It is ongoing since 2007 
and already has been scheduled for more than 1,000 hrs 
on the sky. The radio observational program is extended to 
the full sky with access to the most southern targets us- 
ing the ATNF-CSIRO Parkes 64m radiotelescope. An ac- 
companying photometry program is designed to provide the 
surface photometry of the targets. The theoretical program 
accompanying the o bservational progra m is using Numeri- 
cal Action Methods (jPeebles et al.ll2001 ) and numerical con- 
strained simulations: CLUES (jCottlober. S. et al.l[201ol ) . 

The present paper will focus on 1) a presentation of 
the five complementary data samples of the Cosmic Flows 
project, 2) a description of the pipeline developed for the 
consistent measurement of tens of thousands of HI profiles 
(in anticipation of the coming large surveys), and 3) the 
release of the currently accrued radio HI material associated 
with our five samples. 



2 COSMIC FLOWS PROGRAM: FIVE 
GALAXY SAMPLES 

The goal of the Cosmic Flows program is to obtain an 
all-sky grid of galaxy distances as dense and deep as cur- 
rent capabilities allow and to complement the observational 
endeavor with collaborative theoretical studies to try to 
gain a more rigorous understanding of the local dark sec- 
tor (http://www.ifa.hawaii.edu/cosmicflows/). The volume 
of universe that is currently targeted is bounded by the sensi- 
tivity of the GBT and the Parkes single dish radio telescopes 
at a practical limit of 6,000 km s^^ radius. It is expected 
that the next generation of the Cosmic Flows program will 
be testing a volume up to 15,000 km s~^ using wide field in- 
terferometric technology with blind HI all-sky surveys. For 
the moment, competition for telescope time makes it im- 
practical to target even the ^ 7000 appropriately edge-on 
spiral systems with measured redshifts within 6,000 km s~^. 
Our compromise has been to distinguish three discrete sam- 
ples and to strive for a high level of completion with each of 
the three components. One sample, we refer to as "V3K", 
strives to provide a high density mapping of the volume 
within 3000 km s~^. A second sample selected by far in- 
frared fiux, our "PSCz", extends to 6000 km s"^ and gives 
good coverage at low Galactic latitudes. The third sample 
of extreme edge-on systems, "RFGC", provides sparse cov- 
erage over a large volume. Two smaller samples containing 
several hundred targets were observed for calibration pur- 
pose. One provides the slope and zero-point calibrations for 
the TuUy-Fisher (TF) relation. The other provides a scale 
coupling with the Supernova of Type la (SNIa) method. 
These five samples are described in more detail below. 

While our new observations were focused on the five 
samples to be described, our archival searches have been un- 
restricted. We have gathered all available digital HI spectra 
and coherently remeasured the HI profiles for 16,004 spectra 
of 13,941 galaxies, for which 11,074 galaxies have adequate 
measurement to derive distances. Our observations were 
thus targeting only the galaxies within our specified sam- 
ples and without adequate HI profile measurements. These 
limitations have allowed us to essentially complete coverage 
of our samples with about 1,200 observed targets in 3 years. 
As points of comparison, in recent discussions based on lit- 
erature data, TuUy et al. (2008) is based on 1791 distance 
within 3,000 km s~^ including 1252 TF measures, and Feld- 
man et al. (2010) is based on 4536 distan ces dominated by 
the TF measures of (|Springob et aLlbOOSl ). 

Our goal was not merely to incrementally augment the 
literature information. The current procedures involve a new 
definition of the HI linewidth and, as a consequence, required 
new measurements of 100% of the profiles and a complete 
re-calibration of the TuUy-Fisher relation. 

The number of calibrators have increased by more 
than 100% since th e previous extensive calibration of 
(jTullv fc Pierce! I2OOOI ). giving an additional good reason to 
reconsider the calibration. The new procedures and calibra- 
tion will then be applied to the samples of field galaxies. We 
measure the integrated flux of the HI linewidth, and then 
derive the profile width at 50% of this cumulative flux. This 
method requires access to electronic profiles but then enables 
re-computation of all the available profiles in all archives. 
Our procedures have been compared with similar procedures 
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by t he Cornell sroup and with an accurate non-digital sam- 
ple ICourtois et al.ll2009[ ) with satisfactory results. 



2.1 Calibrators sample 

The luminosity-linewidth calibration divides into two parts; 
the first related to the slope of the correlation and the second 
with the determination of the zero-point. 

The determination of the slope of the luminosity- 
linewidth relation is critical for the minimization of sys- 
tematic errors due to one of the two Malmquist biases; the 
effect on errors that can result from maRn itu de- limited sam- 
ples (Malmqui st 1924; (jTeerikorpilll984h ; (fwillick 1994) ; 
iSandagelirggi ) ) Q. A maximum likelihoo d procedure such 
as that employed bv lGiovanelli et all ()l997t ) can be used that 
provides a basis for corrections for bias if there is an accurate 
understandin g of sources of errors . Instead, our procedure, 
discussed bv iTullv fc Piercd (|200(]| ). does not require a de- 
tailed understanding of errors to correct biases but, rather, 
acts to "null" the bias. Individual distance measures have 
uncertainties but are not systematically offset from true dis- 
tances. 

An essential ingredient to the recipe to null bias is 
a slope fit to the luminosity-linewidth correlation that is 
insensitive to magnitude limits. The desired calibration is 
achieved through the compilation of a cluster template cre- 
ated by successively combining data from individual clus- 
ters with offsets in magnitude reflecting their different mod- 
uli. The calibration is achieved with consideration of 370 
galaxies in 13 clusters of diverse morphologies with distances 
ranging from 15 to 100 Mpc. Each cluster contribute 15-70 
galaxies (median of 25) to the template. Each cluster sample 
is magnitude limited and not otherwise limited in a param- 
eter that constrains the fit for a derivation of distances 0. 

1 There is confusion because Malmquist (1920, 1922, 1924) dis- 
cussed possible biases tliat affect galaxy distance measurements 
in two distinct ways. Suppose individual galaxy distances are un- 
biased but have errors. Since there are an increasing number of 
sources at larger distances, a given shell in measured distance will 
be populated by more galaxies with larger true distances scattered 
inward than smaller true distances scattered outward. Lynden- 
Bell et al. (1988) drew attention to this effect in the context of 
galaxy distances and flows and subsequently these authors have 
referred to it as the Eddington-Malmquist bias in recognition of a 
prior contribution by Eddington (1913). The other relevant effect 
is caused by a magnitude limit. For galaxies at the same distance, 
intrinsically fainter systems can be los t from a sample wh ile intrin- 
sically brighter systems are retained. dStrauss fc Williclj[l9 95?) , in 
their extensive discussion of the subject, refer to this latter ef- 
fect as a 'selection bias' and identify 'Malmquist bias' with the 
Eddington-Malmquist effect. In this paper, we will continue to 
use the term Malmquist bias in connection with the magnitude 
selection problem because of the past popular usage and because 
for what we are doing it tends to be the greater concern. 
^ Galaxies with morphological types Sa and earlier are excluded. 
This selection has a qualitative aspect that could introduce a 
small bias. Also, potential candidates are excluded if there is con- 
fusion in the radio telescope FWHM beam or strong evidence of 
tidal disruption. Galaxies are excluded if they are more face-on 
than 45° but this property is not intrinsically correlated with dis- 
tance and tests have not suggested any distance bias with our 
inferred inclinations. 



The absolute zero point calibration of the template re- 
lation is provided by 39 galaxies that pass the same inclina- 
tion, type, and luminosity criteria as the cluster calibrators 
and have accurately known distances from external mea- 
surements. These calibration distances are based on Hubble 
Space Telescope observations of either Cepheid stars or the 
luminosities of stars at the Tip of the Red Giant Branch, 
with the scale set by the HST distanc e scale key project 
(jPreedman et al.l bOOll ). In ITuUv et all l|2008h we showed 
a preliminary result demonstrating that TRGB, SEE, and 
luminosity-line width (TF) distances are all on a consistent 
zero point scale with those established by Cepheid measure- 
ments. The previous zero point calibration in this co Uabo- 
ration was based on 24 galaxies (|Tu11v fc Pierce|[2000l ). so a 
new calibration is possible using 60% more galaxies. 

To complete these calibrator samples, we observed an 
additional 165 galaxies with no previous satisfactory mea- 
surement with the NRAO-GBT Large Program. We now 
have digital data for the entire s ample. The linewidth and 
fiux analysis that was described in lCourtois et al] (|2009l ) was 
performed in a consistent way. The cluster slope calibration 
sample with accurate line widths has reached 326 galaxies, 
an improvement of mor e than 100% compare d to the 155 
galaxies in 12 clusters in lTullv fc Pierce! l|2000l ). 

2.2 SNIa host galaxy sample 

The SNIa host galaxy sample is designed to provide a con- 
fident link between the local scales probed by luminosity- 
line width distances and dominated by peculiar motions and 
the cosmological scales probed by SNIa distances. To built 
this sample we extracted from the literature spiral edge-on 
galaxies suitable for the TE relation and that have hosted a 
SNIa. The SNIa must have a well sampled light-curve and 
an accurate distance measure men t. We draw sample s from 
iJha. Riess. fc Kirshneil |2007l ) and lTonrv et al. liiooi). With 
the GBT observations program we completed the HI obser- 
vations of a sample of 54 galaxies which should provide a ro- 
bust inter-calibration of the luminosity-line width and SNIa 
distance scales. 



2.3 V3K sample 

A special focus on the region within VheUo ~ 3,300 km s~^ 
derives from three considerations. Eirst of all, this velocity 
bounds the structure that historically has been called the 
Local Supercluster (de Vaucouleurs 1953) with the Virgo 
Cluster at its core and the Eornax Cluster as a secondary 
feature well within the boundary. Secondly, with velocity 
measures of 15% accuracy, peculiar velocities can be sep- 
arated from expansion velocities with uncertainties of less 
than a few hundred km s~^ for individual objects and er- 
rors in collective motions of relatively small groups of galax- 
ies can be brought below 100 km s~^. Third, essentially all 
galaxies that might be typed Sb to Sd can be easily detected 
in HI at high signal-to-noise ratio anywhere in the sky if they 
lie within 3,300 km s~^. 

Our rigorous V3K sample is defined by the following 
criteria: 

- VheUo < 3,300 km s-i 

- Mks < -21 
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— inclination from the axial ratio of greater than 45° 

— type later than Sa 

— no pronounce evidence of tidal disruption 

— HI signal not confused from a second galajcy 

The clip in magnitude at Mks = —21 minimizes selec- 
tion problems. The Ks (short K) magnitudes are drawn 
from the 2M ASS Extended G alaxy Catalog and Large 
Galaxy Atlas djarrett et al."2003') assuring uniform coverage 
of the sky relatively unaffected by obscuration down to low 
Galactic latitudes. Absolute magnitudes are derived from 
velocities and a non-parametr ic model of galaxy motions 
jShava. Peebles, fc Tullvlll995l ). Since the faintest galaxies 
have Ks = 12 and are gas-rich spirals the sample is quite 
complete at high latitudes and all candidates are easily de- 
tectable in HI with GET. 

A velocity histogram of the 1,228 galaxies in V3K is 
shown as seen in Figure [1] The original V3K sample con- 
tained 1,228 galaxies. Of these, 993 galaxies have now mea- 
sured HI spectra of adequate quality and, of these, 852 have 
consistently measured digital profiles. The Cosmic Flows ob- 
servations provided 292 new accurate measurements while 
560 accurate measurements could be recovered from the 
re-analysis of archive material. The 135 galaxies that were 
dropped from the initial sample frequently have a compan- 
ion galaxy in the area included in the FWHM beam size of 
the radiotelescope, or an inclination that appeared on in- 
spection to be less than our limit of 45° . 

The region within 3,300 km s~^ is being populated 
by many other galaxies which are not included in the V3k 
sample, yet suitable for the TF relation. The two samples 
that will be discussed next make additional contributions to 
this volume of 639 galaxies (PSCz sample) and 440 galax- 
ies (flat galaxies). Including miscellaneous contributions, we 
presently are exploring the inner 3,300 km s^^ region with 
2,304 TF distance measures. 

2.4 PSCz sample 

We were motivated by two considerations in defining a sam- 
ple that extends beyond V3K. First, we now know that our 
Local Supercluster is just an appendage on a larger super- 
cluster complex that includes the Norma, Centaurus, and 
Hydra clusters as well as several other clusters such as Abell 
3537, 3 565, and 3574, all in the so-called 'Great Attractor' 
region (|Dressler et al.lll987l ). By extending to 6,000 km s~^ 
we encompass the main part of the overdense region that 
includes our Gala xy. It also includes t he adjacent Pisces- 
Perseus filament (|Havnes fc Giovanellil [l988). Our second 
consideration is the current capability of radio-telescopes. 
Satisfactory HI profiles can be obtained for most spiral 
galaxies within 6,000 km s~^ with integrations of less than 
an hour with GET. The Arecibo Telescope is more sensitive 
but accesses only a modest fraction of the sky. The Parkes 
Telescope is less sensitive and longer integrations are some- 
times required but this facility is only required to cover the 
15% of the sky below -45°. 

In an effort to generate an independent sample that 
isolates normal spirals and is uniform around the sky, at- 
tention was given to the Infrared Astronomical Satellite 
(IR AS) Point Source Cata log - Redshift (PSCz) 0.6 Jy sur- 
vey l|Saunders et al.ll2000l ). Extragalactic sources with cool 
far infrared emission (100 fim flux greater than 60 fim flux) 



are typically normal spirals near morphological type Sc. Tar- 
gets can be selected to low Galactic latitudes limited by 
source crowding. We selected an all-sky sample of 1,690 tar- 
gets by the following criteria: 

— Vcmb < 6,000 km s"^ 

— IRAS Seo > 0.6 Jy and Swo/Sm > 1 

— inclination from the axial ratio of greater than 45° 

— type later than Sa 

— not tidally disturbed 

— an HI signal that is not confused 

Adequate HI profiles have been acquired for 1204 galax- 
ies in this sample. The status of observations is shown in Fig- 
ure [T] where the white histogram describes the initial PSCz 
sample, the black histogram describes the totality of sys- 
tems with adequate HI profile measurements, and the grey 
histogram describes the subset of 470 systems with new HI 
information from the Cosmic Flows observations. 

2.5 Flat Galaxy sample 

A dominant source of error in the TF methodology arises 
from inclination measurements. A characteristic uncertainty 
is 5° , a contribution of half the error budget through the cor- 
rection of linewidths toward the face-on limit of 45° . Worse, 
there is a tail of large errors with photometrically derived 
inclinations. The fiat galaxy catalogs compiled from alterna- 
tively optical and near infrared images (RFGC: Karachent- 
sev et al. 1999; 2MFGC : Mitronova et al. 2004) provide 
appealing lists that circumvent this problem. The extreme 
axial ratios of the fiat galaxy sample assures that candi- 
dates are being viewed almost edge-on so uncertainties in 
the de-projection of circular velocities in a disk are negligi- 
ble. With the optically selected component, the axial ratio 
requirement of a major to minor axial ratio greater than 7 
assures that targets have a morphology around class Sc since 
earlier and later types are intrinsically less thin. Obscura- 
tion issues can be minimized by going to the infrared for 
the required photometry. The restriction to extreme edge- 
on limits the coverage density but results in a coherent and 
tractable sample for coverage of a large volume. 

Our GET targets from the Revised Flat Galaxy Catalog 
(RFGC) were restricted to north of 5 = —40° but outside 
the range accessed by Arecibo. Remaining targets within the 
Arecibo range will either be satisfactorily observed with the 
ALFALFA survey or later with pointed observations with 
the Arecibo Telescope. Remaining targets at S < —40° will 
be observed with the Parkes Telescope. 

In one GET semester we were able to complete most 
of the missing observations for RFGC galaxies with known 
radial velocity less than 6,000 km s~^. Some targets 
were added to fill in the allocated time windows drawn 
fro m the 2MFGC (2M ASS-selected Flat galaxy Catalog) 
( Mi tronova et al] |2004| ) , or from a list of nearby galaxies 
for which no modern digital spectrum is available. A tiny 
fraction of potential targets were a priori discarded, since 
RFGC galaxies are usually isolated, thus there were a very 
low 2% rate of confusion in the radiotlescope beam or gross 
distortion of the candidate from tidal disruption. The non- 
detection rate weis of 14%: 83 galaxies of 577 were not de- 
tected at our fiux limit. More accurately 12% of the RGFC 
and 23% of the 2MFGC galaxies were not detected. The 
high detection rate is inherent to the gas-rich nature of the 
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Figure 1. The original Cosmic Flows samples from top to bottom: V3K, PSCz and Flat Galaxies (white histograms) compared to 
the total of accurate linewidth measurements we have measured (black histograms) and to our GBT and Parkes pointed observations 
(grey histograms). All our samples are now completed in HI observations apart from the region of the sky better observed with Arecibo 
and Parkes. Some galaxies were dropped from the original sample usually because of beam confusion or inadequate inclination. In the 
right panels, histograms illustrate the distributions of source integrated fluxes in HI. The galaxies in the PSCz and RFGC samples are 
progressively more distant on average than the V3k galaxies so are progressively fainter in apparent flux. 



Flat Galaxy sample, while 2MFGC is biased toward 2MASS 
earlier types galaxies. 

The RFGC original sample contains 4,444 galaxies, for 
which 2,788 have a published redshift (white histogram of 
the bottom left panel of Figure [T] Amongst these 2,788 we 
now have an accurate line width measurement for 1,229 
(black histogram). The Cosmic Flows observations provided 
323 (white histogram) new accurate measurements. 

Two figures summarize the properties of the various 



samples. Figure [2] gives combined histograms of the samples. 
There are large numbers of sources within the formal sam- 
ples and large numbers of additional sources with good HI 
profile measurements within 3,000 km s~^. At 3, 000 — 6, 000 
km s~^, there are comparable numbers of sources to the in- 
ner 3,000 km s~^ though spread through a larger volume. 
Beyond, there is a tail of rapidly decreasing coverage that ex- 
tends to ~ 12, 000 km s~^. The distribution of many of these 
sources can be seen in the two slices of a three-dimensional 
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Figure 2. Histogram of sample components as a function of 
velocity. The V3K sample is illustrated by the filled histogram, 
the component of the PSCz sample at 3,000-6,000 km s~^ is illus- 
trated by the open hashed histogram, and the fraction of the Flat 
Galaxy sample with satisfactory HI profile information is illus- 
trated by the denser hashed histogram. The cumulative samples 
are outlined by the dotted histogram. The open solid histogram 
describes the current accumulation of sources with adequate HI 
profile information. 



cube in Figure [31 It is seen that the nested samples provide 
good coverage of our immediate over-dense region but falls 
off rapidly beyond the edges of the structure that we live in. 



3 OBSERVATIONS WITH GREEN BANK 
TELESCOPE 

To strive for completion of the V3K, PSCz, and Flat Galaxy 
samples, we have been observing with the 100m Green Bank 
Telescope at declinations above 5 = —45°, the southern 
limit with GET, but excluding the Arecibo range < 
(5 < +38 where data is currently acquired within the AL- 
FALFA project. Our present strategy is to await the results 
of the ALFALFA multibeam survey currently in progress. 
Currently ALFALFA data releases 1 — 3 have been pro- 
cessed through our pipeline. If profiles are inadequate we 
will request deeper observations of those targets in the 
Arecibo range of declinations. Access to the remaining sky, 
at (5 < —45°, requires observations with the Parkes Tele- 
scope in Australia. Sources are not observed if digital spec- 
tra of good quality exist. 

The single-beam Robert C. Byrd Green Bank Telescope 
(GBT) observations released in this paper are identified as 
ctf2009 and ctm2010 in our "All Digital HI catalog". Ob- 
servations were carried out spanning from February 2007 to 
June 2010 with project identifiers: 07A039 (55 hrs), 07C067 
(218 hrs), 08A072 (47 hrs), 08B041 (50 hrs), 08C010 (340 
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Figure 3. Two slices in supergalactic coordinates that illustrate 
the coverage of the various samples. Top: A 3,000 km s~^ thick 
slice in SGY that shows the main structure of the Norma-Hydra- 
Centaurus- Virgo supercluster complex. Bottom: A 5,000 km s~^ 
thick slice in SGZ that includes the main features of our superclus- 
ter complex and includes the Pisces-Perseus structure at negative 
SGY and positive SGX. All known galaxies from a redshift cata- 
log are plotted in grey. The V3K sample is over-plotted in cyan; 
the PSCz sample is over-plotted in orange; the Flat Galaxy sam- 
ple is over-plotted in green; and the SNIa sample is over-plotted 
in red in larger symbols. 



hrs), and 10A059 (370 hrs). The total of observing time al- 
located to the project at GBT was 1080 hrs. GBT has im- 
plemented a "Legacy ID" numbering for the public access of 
data, our project data can thus been recovered under these 
Legacy ID: GF13, GC47, GC60, GC67, GC69, GC102. 

For the observations we use the single beam L band 
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(1 to 2 GHz) receiver and the spectral line spectrometer as 
the backend detector. The final spectrum is stored with 1.6 
km resolution. It was usually binned at least once to 3.2 
km s~^ resolution for the HI linewidth measurement. 



4 OBSERVATIONS WITH THE PARKES 
TELESCOPE 

Southern targets S < -45 deg for the two samples V3K and 
PSCz are currently being observed with the Parkes Tele- 
scope. 

The Parkes 64m radiotelescope observations released in 
this paper were carried out in February 2009, 1-10: P660: 
60 hrs on the sky. We were able to observe 58 galaxies with 
known radial velocity less than 4,000 km s~^. We obtained 
33 spectra leading to accurate linewidths measurements and 
24 are inadequate for distance measurements. A typical ex- 
posure time for a target at 3,000 km s~^ was 1.5 hour or 
more. The non-detection rate was 16%: 9 galaxies of 58 were 
not detected. 

Figure |4] illustrates the general properties of the HI pro- 
files that were adequately detected in the course of the Cos- 
mic Flows Large Program with GET and the supplement 
with Parkes Telescope. 



5 ANALYSIS OF HI LINE PROFILES 

With the award of our project as an NRAO GET Large 
Program, we reconsidered the traditional ways of measuring 
the HI linewidth. We were looking for a robust method that 
could be applied to both low and high signal-to-noise ratio 
spectra. We also wanted to be able to remeasure a large 
quantity of archive HI data from various telescopes, so the 
method would be as automated as possible. 

Historically, Tully and Fisher measured linew idths W20 
at 20% of peak intensity (|Tu11v fc Fished 1 197?! '). However 
the 20% level is frequently close to the noise level leading 
to spurious measurements. Also the 20% level is difficult to 
secure when profiles are strongly asymmetric. We now pre- 
fer to measure the width enclosing 50% of the cumulative 
HI line flux, Wmso- Specifically, Wmso is the line width at 
a flux level that is 50% of the mean flux averaged in chan- 
nels within the wavelength range enclosing 90% of the total 
integrated flux. The exclusion of 5% of the flux at the high 
and low wavelength edges minimizes problems in the profile 
wings. The Wmso line widths are measured at slightly higher 
flux levels than W2o- The random scatter between these two 
measures is the lowest found among alternatives that were 
examined. 

The parameter Wmso is an empirical measure of the 
width of an HI profile. We correct for redshift and instru- 
mental broadening with the formula 

W^™50 = ^-2A^A (1) 
r -7- z 

where z is the redshift, Ad is the smoothed spectral reso- 
lution, and A = 0.25 is an empirically determined constant. 
The observed line width can also be adjusted by separating 
out the broadening from turbulent motions and offsetting to 
produce an approximation to 2Vmax where Vmax character- 
izes the rotation rate over the main body of a galaxy. We 
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Figure 4. Histograms of the galaxy properties of the 1,423 suit- 
able profiles obtained in the Cosmic Flows Program in HI at GBT 
and Parkes: (a) heliocentric recession velocity in km s~^; (b) HI 
linewidth at half power (WmSO); (c) logarithm integrated flux ; 
(d) logarithm of the signal-to-noise ratio. 
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have defined the parameter where 

-2H/™5om,™5o[l - e-e^"^"''^'^-"™)'] (2) 

with Wc,m50 = 100 km and Wt,m50 = 9 km s"\ Then 
= Wmi/sin(z) where i is the galaxy inchnation from 
face on. Details regarding the Wm50 and W^^ line width 
pa rameters and compari sons with alternatives are discussed 
bv lCourtois et al.l (|2009l ). 

All modern spectra are available in a digital form. Con- 
sequently it is straight forward for us to pass archival data 
through the same analysis procedure as we apply to our 
newly observed material. It, of course, means that we do 
not need to observe a galaxy if we can access adequate ob- 
servations from any archive. The primary sources of data 
in our current catalog originate, by decreasing number, 
from Arecibo (7,898), Nangay (3439), GBT (1,444), Parkes 
(1052), the old NRAO 300' (1,059), the 140' (696) and Ef- 
felsberg (235). 

The assignment of errors for Wmso or quality flags is a 
particularly challenging problem. Formally, we link errors to 
the pr ofile signal to rms noise as described bv lCourtois et all 
l|2009h . From comparisons between alternative observations 
our cited errors appear to be conservative; about 50% larger 
than a la value. Also on the side of being conservative, we 
assume that the line width uncertainty is 8 km s~^ in the 
best of cases. Estimates are difficult to quantify precisely 
because the error on Wmso is linked to three characteristics: 
signal-to-noise ratio, observational flux limit, and shape of 
the HI line with flux in the wings that may or may not be 
real. Our interest is to use the profile widths as a parameter 
in the measurement of distances. Even if our errors do not 
have a precise absolute sense they do have a relative sense. 
Within our system, we consider that there is a threshold 
of acceptability: a profile may be of sufficient quality to be 
used in the determination of a distance, or it may not be. 
We link our error estimate to this threshold. Specifically, an 
adequate profile is assigned an error of less than or equal 
to 20 km s~^. Inadequate profiles are identified by errors 
greater than 20 km s~^ (see examples in Figure [S]). Con- 
fused profiles are identified by the error fiag of 100 km s^^. 
Non-detections are identified by the error fiag of 500 km s~^ . 
Since for distance measurement purposes we will not use any 
profile with an error larger than 20 km s"'^, measurements 
with such large errors are not taken into account in the av- 
eraged WmxSM column of our 'AH Digital HI Catalog'. Thus 
if a galaxy has only one measurement and it is considered 
inadequate then there will be a value given for Wmso among 
the parameters measured from the specific source profile but 
no value will be registered for Wmx-S-v, see for example line 
2 of Table □ 

In our compilation of digital spectra, some galaxies have 
been observed by 2 or 3 different telescopes. Inter compar- 
isons between different HI observations of the same galaxies 
suggest that the characteristic accuracy of an individual ac- 
ceptable profile width is 7 km s~^. 

Our HI observations at Green Bank and Parkes 64m 
telescopes are made available in an electronic form with this 
article. A few lines are given in Table [1] as example. The 
Columns are respectively: PGC name, source of observa- 



tions, telescope, heliocentric velocity, Wmso, error on Wmso, 
signal-to-noise ratio, integrated fiux in the line. 



6 LINKING ANALOG AND DIGITAL LINE 
WIDTH MEASUREMENTS 

Even though there are now digital spectra for most galaxies 
the collection remains incomplete. Indeed, it will be diffi- 
cult to achieve completion. Some very nearby galaxies are 
much larger than the FWHM beam sizes of modern radio 
telescopes. They could be mapped. Better, integrated pro- 
files can be reconstructed from observations with the West- 
erbork. Very Large Array, or Australia Telescope interfer- 
ometers. Alternatively, profiles obtained in earlier days with 
smaller telescopes like the NRAO 140' or the Dwingeloo fa- 
cilities might be the best available. 

A considerable effort was made in earlier years to accu- 
mulate a consistent compilation of analog line widths. The 
contributing observations were made with many telescopes 
and reported by many sources but all W20 measures in our 
compilation were coherently measured by only three people: 
J.R. Fisher, R.B. TuUy, or C. Hall. 

A comparison was made between an alog W20 mea- 
surcs and our new digital Wmbo measures bv lCourtois et akl 
( 2009 ). In applications related to distance determinations 
we will be more interested in the line width parameters re- 
lated to the physical property Vmax, the characteristic ro- 
tation velocity across the disk of a galaxy. To this end, we 
now compare the parameters that approximate 2VmaxSmi 
for the separate analog and digital samples. This comparison 
is shown with Figure [B] The analog parameter Wr is calcu- 
lated here using a constant value Wt,20 = 22 km s~^ in the 
equivalent to Eq.[2]for the case of the analog transformation 
using 1^20- This value for the thermal broadening constant is 
differe nt from the value W°^A ) = 38 km s~^ originally advo- 
cated ||Tu11v fc Fouaue|[l985l ) for the transformation to Wr . 
For a discussion of this change see ICourtois et al.l (|2009l ) . 
Based on 1755 galaxies with good analog and digital pro- 
files, we find a relation that allows us to transform from 
analog to digital parameters: 

M/^o. = 1.015Wj? - 11.25 (3) 

The r.m.s. scatter is 10 km s~^ suggesting uncertainties of 
roughly 7 km s"'^ in each of the measured parameters. 

It is seen that there is a slight slope to the difference 
between the alternative line width measures Wr and Wmx 
which in principal are both approximating 2Vmaa:Sini. The 
two measures are essentially equal at large line widths but 
there is a mean difference that grows to 10 km s~^ for the 
smallest line widths. Because of this systematic between Wr 
and Wmx it is essential that the relation between galaxy lu- 
minosity and line width (the TF relation) be calibrated for 
the specific line width measure that is used. Whatever the 
line width measure and to whatever degree it may approx- 
imate a physical parameter, it remains subject to observa- 
tional vagaries. 
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Figure 5. Example of two adequate profiles (top and middle) and one inadequate (bottom). Profiles are suitable for distance measurement 
using the Tully-Fisher Relation when an error on WmSO was set to 20 km s~^or less. The corresponding flux integration curves are 
displayed on the left with the 5% and 95% window (blue vertical lines) for the WmSO measurement. These profiles are showing an 
icreasing error from top to bottom. 



7 HI CATALOG IN EXTRAGALACTIC 
DISTANCE DATABASE(EDD) 

Our new observations from the Cosmic Flows program and 
archival information are combined. Tabular information and 
hne profile plots are provided for 13,941 galaxies in the All 
Digital HI Catalog (ADHI) in EDD. Intensity-velocity ascii 



tables are provided in the cases of material from the Cos- 
mic Flows observational program. Currently the catalog con- 
tains 16,004 HI profiles, including 1,859 new profiles added 
through this program. There are 11,074 profiles that are 
deemed acceptable; ie, with uncertainties ^ 20 km s~^. For 
1339 galaxies there are at least 2 acceptable profiles and in 
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Table 1. 1,822 Cosmic Flows Green Bank and Parkes HI observations 



PGC 
name 


source 
code 


telescope 


km/s 


Wm50 

km/s 


error on Wmso 
km/s 


signal-to-noise 
ratio 


integrated flux 
Jy.km/s 


PGC0000094 


ctm2010 


GBT 


4098 


188 


14 


8.9 


3.6 


PGC0000207 


ctf2009 


GET 


6597 




25 


2.6 


1.9 


PGC0000218 


ctf2009 


AGG 


1051 


453 


8 


26.2 


11.7 


PGC0000279 


ctm2010 


GBT 


2311 


411 


11 


12.4 


14.0 


PGC0003743 


ctk2010 


PAKS 


2310 


361 


17 


5.0 


13.5 




O.OlSWu 



800 



W, (km/s) 

Figure 6. Comparison of the line width measures designed to ap- 
proximate 2Vmax with alternatively the old analog measurements 
and the new digital measurements of line width, respectively Wji 
derived from W2Q and Wmx derived from Wm.50- Globally there 
is an offset in the difference of the two parameters of 8 km s~^ 
with a dispersion of 11 km s^^. In detail there is a small gradient 
in the difference described by the solid line. 



82 cases there are three acceptable profiles. Additions are 
continually being made to the catalog. 

Figure [7| provides a graphic summary of aspects of the 
All Digital HI Catalog. The redshift distribution in the top 
left panel contains peaks at 1,800 km s~^ and 5,000 km s~^ 
. The two peaks are partially due to the development of the 
sample with the increasing capabilities of telescopes, with 
an early emphasis on the region within 3,000 km s~^, and 
partially a refiection of the distribution of nearby structure. 

In the top right panel it is seen that there is a tail to 
the line width distribution that extends to 1,000 km s~^. 
The galax;ies that contribute to this tail are of sufficient in- 
terest that we will discuss them in a later publication. It is 
seen in the lower left panel that galaxies with a full range 
of line widths are seen over a wide redshift range. Within 
3,000 km s~^ there is greater representation of narrow line 
width systems as a consequence of sample selection. 

Since our full catalog is a compendium of many sources 
there are artifacts of the collection. In Figure [8] we plot the 



Table 2. Radio- Telescopes in EDD (Extragalactic Distance 
Database) 



Telescope 


Aperture 


Beam 


EDD 


EDD 




meters 


arcmin 


Acronyms 


spectra 


Arecibo 


305 


3 


AOG-AOlf-ALFA 


7,898 


Nangay 


200x40 


4x22 


Nanc 


3,439 


GBT 


100 


9 


GBT 


1,444 


GB300 


91 


10 


GB300 


1,059 


Parkes 


64 


14.5 


PAKS 


1,052 


GB140 


43 


21 


GB140 


696 


Effelsberg 


100 


9 


Efll's-Eff 


235 



logarithm of the integrated flux vs. the logarithm of the 
linewidth. The top panel shows that with our Cosmic Flows 
observations large line width targets are as equally well rep- 
resented as small line width targets at the faint limit. This 
situation arises because our integration times are established 
to be long enough to acquire adequate profiles case by case. 
By contrast, in the middle panel it is seen that there is a 
strong dependence of the faint limi t in the sample of the 
1000 brightest galaxies in HIPASS (jKoribalski et al.ll2004l ) 
where integration times are fixed. A similar trend though 
at a lower flux leve l is found in the A LFALFA survey af- 
ter 40% completion (jMartin et al.ll2010l l. Blind surveys with 
fixed integrations tend to miss low flux/high rotational ve- 
locity galaxies. The bottom panel shows this trend is not 
severe for the ensemble of our All Digital HI catalog, since 
it is comprised at 85% from targeted observations. For com- 
parison we over-plot the detection limits for HIPASS (red 
upper line) and ALFALFA 40% (green dashed line). 

Figure [To] shows the distribution of the detected masses 
with distances. The lower green solid line is the ALFALFA 
5 a sensitivity at 5.6 Jy.km s~^and the upper red line is the 
HIPASS 5 a detection limit at 0.372 Jy.km s"\ 

HI mass is computed using the integrated flux and the 
redshift of the galaxies using: 

Mhi = 2.36 X IO^Dmpc x FI (4) 

where Dmpc is the distance, and FI is the integrated flux in 
Jy.km s~^. For consistency in a comparison with ALFALFA 
velocities are converted to distance assuming Ho — 70 km 
s~^ Mpc~^. Figure [9] shows the distribution of HI Mass in 
our all sky catalog. It is com parable wi th the ALFALFA 40% 
HI Mass function from (jMartin et al.l[2010. ) displayed with 
the green line. The cutoff at high HI mass is abrupt above 
3 X 10^° Mq. The interesting objects in the high HI mass 
tail will be discussed in a separate publication along with 
the objects in the high line width tail. 
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Figure 7. Histograms of the galaxy properties of the 11,051 galaxies with adequate HI profile for distance purpose which have been 
consistently analyzed by the Cosmic Flows project: distribution in heliocentric recession velocity in km s~^; distribution in HI linewidth 
at half power {Wm5o)\ WmSO function of heliocentric velocity; distribution of the logarithm of the integrated flux. 



Table 3. All Digital HI Catalog sources 



Code 



Literature source 



ksk2004 Koribalski, Staveley-Smith, et al. 2004 

shg2005 Springob, Haynes, Giovanelli, Kent 2005 

hkk2005 Huchtmeier, Karachentsev, et al. 2005 

tmc2006 Theureau, Martin, Cognard, et al. 2007 

ghk2007 Giovanelli, Haynes, Kent, et al. 2007 

sgh2008 Saintonge, Giovanelli, Haynes, et al. 2008 

kgh2008 Kent, Giovanelli, Haynes, et al. 2008 

ctf2009 Courtois, TuUy, Fisher, et al. 2009 
ctm2010-ctk2010 Courtois, et al. (this paper) 



8 SUMMARY 

Over the years, many observers at the world's largest tele- 
scopes have acquired useful information about the neutral 
hydrogen properties of spiral galaxies. Our primary concern 
is to measure distances by exploiting the correlation between 
galaxy HI profile line widths and luminosities, the TF rela- 
tion. Five samples have been defined: one to calibrate the 
TF relation with newly defined line width parameters and 
photometry, another to assure uniformity with the SNIa dis- 
tance scale, and three more that provide all-sky coverage 
to different depths, with different densities, and with dis- 



tinct selection criteria. Observations with the Green Bank 
and the Parkes telescopes within the Cosmic Flows program 
have built upon the body of archival material to the degree 
that adequate HI profile information now exists for almost 
all the galaxies in our five samples. 

Tabular and graphical information on the HI proper- 
ties of galaxies, whether from new observations or from the 
archives, is gathered and made available at the Extragalac- 
tic Distance Database: ^http: / / edd.ifa.hawaii.edu j select the 
catalog 'AU Digital HI'. 
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Figure 8. The distribution of sources detected with an adequate 
profile and an adequate signal-to-noise ratio: (a) our GBT and 
Parkes observations, (b) 1000 brightest HIPASS profiles, (c) the 
All Digital HI Catalog with overploted the detection limitation of 
HIPASS (green upper line) and ALFALFA 40% (red lower line). 
Our all sky catalog, which is mainly composed of targeted obser- 
vations, doesn't show the detection bias trend of blind surveys. 
Blind surveys tends to not detect the low flux/high rotational 
velocity galaxies. 



Figure 9. Histogram of the distribution of HI masses of 11,051 
galaxies with good profiles, plotted as logarithm of the HI mass 
in solar units. The results from 40% ALFALFA are shown in red 
jMartin et al.ll2010h . 
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Figure 10. The 11,051 accurate profiles plotted as 
log{MHi/M(T)) vs. distance in Mpc. The adopted value to com- 
pute the distance here is Hq = 70 km.s^^.Mpc"'^. The lower red 
solid line is the ALFALFA 5 cr sensitivity and the upper green 
line is the HIPASS detection limit. 
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